SEQUENCE LISTING 
Underlining in each sequence indicates location of the chromophore domain. 
Potential cysteine attachment sites are indicated in boldface. 

Cphl: Locus SLR0473 = S6\<)3PHY1 = SYN_PHY (Yeh et al, 277:1505-15-8 ( 1997)), a 
748 aa protein. 
SEQIDNO:l 

mattvqlsdqslrqletlaihtahliqphglvvvlqepdltisqisanctgilgrsped 
llgrtlgevfdsfqidpiqsrltAgqisslnpsklwarvmgddfvifdgvfhrnsdgl 
lvcelepaytsdnlpflgfy hma^aalnrlrqqanlrdfydviveevrrmtgfdr 
vmlyrfdennhgdviaedkrddl^pylglhypesdipqparrlfihnpirvipdvyg 
vavpltpavnpstnravdltesilr^yhchltylknmgvgasltislikdghlwgl 
iachhqtpkvipfelrkaceffgrvwsnisaqedtetfdyrv qlaeheavlldkmt 
taadfvegltnhpdrllgltgsqga^fefgeklilvgetpdekavqyllqwlenre 
vqdwftsslsqiwdavnfksvasgllaipiarhnfllwfrpevlqtvnwggdpnh 
ayeatqedgkielhprqsfdlwkeivrbqslpwqsv^iqsalalkkaivnlilrqaee 
laqlarnlersnadlkkfayiashdlqeVlnqvsnyvqllemrysealdedakdfi 
dfavtgvslmqtliddiltyakvdtqyaoltftdvqevvdkalanlkqrieesgaei 
evgsmpavmadqiqlmqvfqnliangikpYgdkspkikiwgdrqedawvfavqdn 
gigidpqfferifvifqri.htrdeykgtgmgAaickkiieghqgqiwlesnpgegstfy 
fsipign 

cph2 Locus SLL0821 (Manabe K, Nakazawa M. JPhnt Res. 110: 109-122 (1977)) 
GB:D64003 
SEQ ID NO: 2 

MNPNRSLEDFLRNVINKFHRALTLRETLQVIVEEAR^rFLGVDRVKIYICFASDGSGEVL 

AEAVNRAALPSLLGLHFPVEDIPPQAREELGNQRKAQAVDVAHRRKKSHELSGRISP 

TEHSNGHYTTVI)SCHIQYLLAMGVLSSLTWVMQDQWwGrMAVHHSKPRRFTEQ 

EWETMALLSKEVSLAITQSQLSRQV HQQQVQEALVQRkETTVAQYGDRPETWQYA 

LETVGQAVEADGAVLYIAPDLTGSVAQHYQWNLRFDWGNWLETSLWQELMRGQP 
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SAAMEPMAAVQSTWEBCPRPFTSVAPLPPTNCVPHGYTLGELEQRSDWIAPPESLSAE 
NFQSFLIVPLAAD^QWVGSLILLRKEKSLVKHWAGKRGIDRRMLPRLSFEAWEETQ 
KLVPTWNRSERKLA^VASTQLYMAITQQFVTRLITQQTAYDPLTQLPNWIIFNRQLT 
LALLDALYEGKMVGV^IAMDRFKRINESFGHKTGDGLLQEVADRLNQKLSPLAA 
YSPLLSRWHGDGFTILLTOISDNQEMIPLCERLLSTFQEPFFLQGQPIYLTASMGISTA 
PYDGETAESLLKFAEIALTRAKCQGKNTYQFYRPQDSAPMLDRLTLESDLRQALTN 
QEFVLYFQPQVALDTGKLLGVEALVRWQHPRLGQVAPDVFIPLAEELGLINHLGQW 
VLETACATHQHFFRETGRRLRMAVNISARQFQDEKWLNSVLECLKRTGMPPEDLEL 
EITESLMMEDIKGTWLLHRLRVEGVQVAroDFGTGYSSLSILKQLPIHRLKIDKSFVN 
DLLNEGADTAIIQYVTOLANGLNLETVAEGIESEAQLQRLQKMGCHLGQGYFLTRPL 
PAEAMMTYLYYPQILDFGPTPPLPJCVALPETETEAGQGNVGDRPLPNSLNRENPWTE 
KLHDWLLKERLQQRNVKE KLVLklANKIRASLNINDILYSTVTEVRQFLNTO 
FBCFNSQWSGQWTESHNDFCRSIBSTDiEIDDPCFKGHYLRLYREGRVRAVSDIEKADL 
ADCHKELLRHYQVKANLVWVVF^NLWGLLIAHECKTPRYWQEEDLQLLMELAT 
QVAIAIHQGELYEQL ETANIRLQQISSLPALTQVGNRYLFDSTLEREWQRLQRIREPL 
ALLLCDVDFFKGFNDNYGHPAGDRCLEpADAMAKVAKRPTDLVARYGGEEFAIIL 
SETSLEGAINVTEALQVEVA^AIPHTVSGTGHVTLSIGIAWTPERHTNPNALVKAA 
DLALYEAKAKGRNQWLAYEGSQLPHVDGEV 

Cph3. Locus SLL1473 (RcaE Homolog) a 297 aa histidine kinase homolog 

SEQ ID NO: 3 

mgkflipiefvflaiamtcylwhrqnqerrrielsikqqtqr erfinqitqhirqslnle 
wlnttvaevktllqvdrvliyriwqdgtgsaitesvnanypsilgrtfsdevfpvey 
hqaytkgkvransfdroqddieicladfvkqfgvi^klvvpilqhnrassldnesefp 
ylwgllithqcaftrpwqpwevelmkqlanqvaiaiqqselyeql qqlnkdlenr 
vek31tqqlaatnqslrmeiserqkteaalrhtnhtv.qsliaasprgiftlnladqiqi 
wnptaerifgwteteiiahpelltsnilledyqqfkq^vlsgmvspslelkcqkkdgs 
wiewlsaaplldseeniaglvavvaditeqkrqaeqirllqsvvvntndavviteae 
piddpgprilyvneaftkitgytaeemlgktprvlqgpVtsrteldrvrqaisqwqs 
vtveaevlndsykekksplk 



41 



cph4 Locus SLL1124 (DivJ homolog; PAS domain) A 1372 aa protein 
that is more similar to rcaE than to cphl [Wilde et al. FEBS Lett 406: 89-92 (1997). 
GB:U67397 (D90905 rfr SLL1124) 

SEQ ID NO: 4 

mtfaatprevtasaiqwaclclpgelsaaealnrwhrhgqrswepp 
aeakafppwalvldndgqllgllpdwqlaaalwtehfspaialaelclpcslrldl 
eklpslgevmqifatwgygwdvipvadrqhqtwgllsignlirsvnlcqlwqnlpl 
qvtaspplclgtettlgelvifflcferqissfpvvysspllpaaapriplgnvslsnyfk 
gpnygslgldnpigpdlsptfpActinqtycharellrrqnddyviitnisgafvgwv 
gpqqwlatvqpdvllealqrewmprivqhlearivwqqqqqqrnqhliqkllsr 
npnliylydlvkl^ivylnipgsldeggsggapipnpmvetdprqdlllppryfglee 
laalqahekkefnfeftdggqsvhWweisafeidgsgqtskilclaqdvshgkra 
eaalhtkeqqlqtlvntiadgivildfedkviyanpmacqmfglskeeflqsqlgls 
nrgqteiginvspeeegigeikavpfflwogedcrlvtvrdvtdrqrvlkklrdseqih 

RSLLEALPNLVWRLSSAGDVWECNQRTLAYFGRRGRKILGNTWQQFIEPGERENVQ 

RQWRQGIAAQEFFQLECRLWRSDGQYRWHLLQVLPLEDRFGSINGWLASSTDIDDL 

KEAEKALRNQAQQE KLLSSISQRIRESLKLETILRTTVTEVRRTIHADRVLIHHIQEDG 

LGTTIAESVVNGQPSVMQMDLSPESFPPECYQRYLNGYIYASRDQLPDCAINCAVQC 

FTVAESQSRIVAPiVFDHSLWGLLIVHQCSSSRTWQTAEIQLMQSLGNQLAIAIQQSL 

LYERL QEELSERQRAEQKLLEV^QLQKGIFDAMNYMIISTDRRGIISTFNRTAEEILGY 

TAAELIGQQTPLIFHDQEEMASEAVQLSQQLQQTIRPNSEDMFAIPAIQWGVYEREWT 

YITKTGDRLPVYVSITALRDDQGKVDGLVGVITDJLRRQKQIERERQNLDFVVKNSTE 

LIVITDLEQKVTFLNQAGQSLIGLENPETAQTTYLSEHISPEYLNFWQMEIIPQVFRSG 

AWEGEFSLQHYQTAVEIPVTASVFLLQGVNGQHPA^VAIVHDITHIKNAEKRILAA 

LEAEKELGELRSRFISTTSHEFRTPLAIISSSTGILKKYWPKLDGQRRGQHLERIEESVH 

HMVELLDDVLTINRAETKYLPFEPQPLDLVSFCRGITDELQSSTEYHGLLFSYDGLGP 

GEIVAFDPKLLRQILTNLLGNAIKYSPSGQPVEFHLQRRGDVGIFSVQDHGIGIGPEDI 

PNLFDSFYRGTNVGSIPGTGLGLPIVKKCAELHGGMITMTSQLGQGSRFEVELPLWY 

S \ 



42 



cphSXocus SLL0041 (locus 1001300) An 891 aa protein, methyl-accepting 
chemotaxis protein I.NHomology to tsr in last 250 amino acid residues. GB:D64006 Bilin 
Binding domain residues 386-550 

SEQ ID NO: 5 

maeafiaWtavedvspnpnpaidtdalaaltqsaveltppppinlpkv 
elppmqplaplmaiadpd^spmstsiqaptqsgglslrnkavlialliglipagvigg 
lnlssvdrlpwqteqqvkdsttkqirdqiligllvtavgaafvaywmvgentkaq 
talalkakhshrnldqplaVagdelaiadqtidalsaqveklrhqqdlslkqaell 
telsranlsdideiqgviqkn^qaralfgcerlvfyyhpryqpeamvvqaldlttq 
glidskdphpwgqedmpsqivapndtsgasisnphrqwleqhqvkasltvplhrdn 
yplgllmahhcqrphqwemreVofloolteelqttldranliqernesaqq aqilk 
eltlosaainseqwdiaaqeirlfe^kadrvivyrfdatwagtvivesvaegypkal 
gatiadpcfadsyvekyrsgriqat^iynagltpcmgqlkpfevkanlvapinyk 

GNLLGLLIAHQCSGPRDWHQNEIDLaGQLTVQVGLALERSDLLAQQK IAEVEQRQM 

rekmqkralellmevdpvsrgdltirVjivtedeigtiadsynatieslrrivtqvqt 
aasqftettdtnevavrqlaqqanrqaldvaealerlqamnksiqavaenaaqae 
savqratqtvdqgedamnrtvdgivair^tvaatakqvkrlgessqkiskvvnlig 
sfadqtnllalnaaieaahageegrgfavvadevrslarqsaeataeiaqlvatiq 
aetnevvnameagteqvvvgtklveetrr^lnqitavsaqisglveartsaaieqsq 
tsesvtqtmalvaqiadknsseasgvsatfkbllavaqslqeavkqfkvq 



<KBLL. 



cph6 locus SLR1212 (ETR1 homolog; PAS domain) An 844 aa protein, 
chromophore domain 461-628 

SEQ ID NO: 6 

MAITAFTLGDFFQANSYIPHGHCYLWiDTPLVWLHVSADFFTAIAYYSI 
PLTLLYFLRKRQDIPFPMIFLFSTFILCCGTSHFFDnTnWYPn^SGTVKASMAIVSII 
TWELIQrVPNALNLKSPTELATLNLALNQEIKERQTAEIALQELNNNLEKRVEDRTT 
QLAKINQQLEQEIEDKTRAKEDLEKNKDQLAQLAAIVESSQDAIISKTLDGNITSWNE 
SAERLFGYTAEEMIGSHITKLIPEELILEEDLIAECIRQG(^RINTYETQRQRKDGTKIDV 
ALTISPIRDEHK>rVVGASKIVRDITARLDVENALRESQYWEKiANYSPQILYILDPIA 
WKNIYVNYQSFEILGYTPEEFKNGGTELLLMVHPDDIPT^YENKNFWQKSQEGQVL 
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TTEYRMRHKNGSWRWLRSREVVFARDDYGQVTKVLGTAQDISDSKEQEQRLYEQ 

GRRE SLLREITQRIRQSI^LPTIFNTVVQEIRQFLEADRVVIFQFSPDSDFSVGNIVAES 

VLAPFKPIINSAIEETCFSN^AQRYQQGRIQVIEDIHQSHLRQCfflDFLARLQVRANL 

VLPLINDAILWGLLCIHQCDSSRVWEQTEIDLLKQITNQFEIAIQQATLYEQAQQ ELA 

SKNQLFVQLTNELEQKK\ai^IHHRVKNNLQIMSSLLYLQFSKASPAIQQLSEEYQ 

NRIQSMALmEQLYRSEDLANTOFSQYLKNLTHMCQSYGCNTDSIKIKLLVEQVKVP 

LEQSIPLGLIIQELVSNALKHAFRTTEGEISIKFTSMNSHYSLQVWDNGVGISRDIDLE 

NTDSLGMQLIYSLTEQLQGELHyCEYVGGAQFGLEFSL 

cph7 (locus SLR1393, a 950 aa protein chromophore domain 402-620. 
Contains a histidine kinase tranmitterqomain. 

SEQ ID NO: 7 

mspsshgtavqqaiadqiilemilqsqdlhnayrlvveglqrglgvd 
rvllvqnavfpnrqsrlvaqaiapardmilldepcadcrwlhllgqlphyglwtv 
wegegefvqldpvqgefcrtlgiksllhdplvinqrhwgvlslqylhqarpwpled 
qqfaqriahlfclglmktelwmcqnhknalqtvvaegqvqretylksaqrerai 
advidkirfaldlrslfqttvtevrkllvadrvmiikwqnknfswgeiqaeaqtdd 
klcllppkervplssrwidhfakglilamddtddqradfdqsmlalakanlvvplf 
sgdrlwgvlsvhqcdgprvwessdiefalkialnlgvalqqaelltesqrrstalq 
salgeveaqkdylariaeee raltrviegirqAelqnifratsdevrhllscdrvlv 
yrfnpdwsgefihesvaqmweplkdlqnnfplwqdtylqeneggryrnheslavg 

DVETAGFTDCHLDNLRRFEIRAFLTVPVFVGEQLWGLLGAYQNGAPRHWQAREIHL 

LHQIANQLGVAVYQAQLLARF QEQSKTMENTLADLTAIVDNLADGLLVIDLFGRITR 

YNPALLAMFDLEGLELLGAGVDAYFPETLNQLLAKPEREEQKLVTADVELSQGRQG 

QAilTSITSHENGCEYPQCLGAVIMIRDVTHEREVERMKTDFLATVSHELRTPLTSIL 

GFATVIQDKLNRVIIPELDLAQPHLGKATERVMRNLAIIESEAQRLTVLINDVLDLAK 

MEAGQESWQEQPCAIGPIIERAIATITPQAQKKNISLQGDLEPGLPDFIGDENRILQVV 

LNLLSNAVKFTPKGLITARSHFHQNYLWVEIIDHGPGIMPADQEKIFEPFQQGGGDVL 

TDKPQGTGLGLPICKKIVEHHGGTIGVNSSLGRGSTFYFSLPVPVPAVETSPAV 
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Cph8 locus SLR*969, a 750 aa protein. CHROMOPHORE DOMAIN 
(156-347). Contains a histidine kinase tranmitter domain. Adenylate cyclase homology 

SEQ ID NO: 8 

mlpafspifrrllpavVerllrfwrtlaqqtgdgvqcfvgdlpsslk 
pppgpsvleaevdhrfallvspgqwa\legeqisphhyavsitfaqgiiedfiqkqnlp 
vvaeamphrpetpsgptiaeoltlglleilnsdstsfspepslqdslqasqv kllsqvi 
aqirqsldlseil>jnavtavqkflfvdrlViyqfhysqpsltpleenqipaprprqqy 
gevt ye arrspeidtmlgimtendcfs q wkyeqkylkg avv avsdienh ys s s ycl 
vgllqryqvraklvapiivegqlwglliahq^ffiiprqwldseknflgqigehlava 
ivqsllysev qkqknnfekrviertkelrdtlRWlqaanllksqfinmshelr 
shglsatllrwfdhpaslppakqqyyllmqengkklldqinsiiqlsqlesgqtaln 
cqsfslhtlaqtvihsllgvaikqqinleldyqijA^gqdqfcadqerldqiltqllnn 
alkftpaegtvilriwkesnqaifqvedtgigineqqlpvlfeafkvagdsytsfyet 
ggvglaltkqlvelhggyievesspgqgtifttvipoqnfppttkgqvqdkldaamp 
fnssvivieqdeeiatlicelltvanyqviwlidttnalqqvellqpgliivdgdfvdv 
tevtrgikksrriskvtvfllseslssaewqalsqkgipdyllkplqpelllqrvqsiq 

QEPLR 
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